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DESCRIPTION 

Digital Signal Processing Method and Learning Method and Devices 
therefor, and Program Storage Medium 

Technical Field 

The present invention relates to a digital signal processing 
method and learning method and devices therefor, and a program 
storage medium, and is suitably applied to a digital signal 
processing method and learning method and devices therefor, and a 
program storage medium in which data interpolation processing is 
performed on digital signals by a rate converter or a PCM (Pulse 
Code Modulation) demodulation device. 



Background Art 

■ Heretofore, oversampling processing, to convert a sampling 
frequency to a value several times higher than the original value 
is performed before a digital audio signal is input to a 
digital/analog converter. With this arrangement, the phase feature 
of an analog anti-aliasing filter keeps the digital audio signal 
outputted from the digital/analog converter, at a constant level 
in the audible high frequency band, and prevents influences of 
digital image noises caused by sampling. 

Typical oversampling processing employs a digital filter of 
the primary linear (straight line) interpolation system. Such 
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digital filter is used for creating linear interpolation data by 
averaging plural pieces of existing data when the sampling rate is 
changed or data is missing. 

Although the digital audio signal subjected to the 
oversampling processing has an amount of data several times more 
than that of the original data in the direction of time-axis 
because of linear interpolation, the freguency band of the digital 
audio signal subjected to the oversampling processing is not 
Changed so much and the sound guality is not improved as compared 
with before. Moreover, since the data interpolated is not 
necessarily created based on the waveform of the analog audio 
signal before it is A/D converted, the waveform reproducibility is 
not improved at all. 

Furthermore, in the case of dubbing digital audio signals 
having different sampling freguencies, the frequencies are 
converted by means of the sampling rate converter. In such cases, 
however, the linear digital filter can interpolate only linear 
data, so that it is difficult to improve the sound quality and 
waveform reproducibility. Furthermore, in the case where data 
samples of digital audio signal are missing, the same results as 
those of the above occurs. 

Description of the Invention 

The present invention has been done considering the above 
points and is to propose a digital signal processing method and 
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learning method and devices therefor, and a program storage medium, 
which are capable of significantly improving the waveform 
reproducibility . 

To obviate such problems, according to the present invention, 
a part is cut out of a digital signal with each of plural windows 
which are different in size to calculate a self correlation 
coefficient, and the parts are classified based on the calculation 
results, that is, the self-correlation coefficients, and then the 
digital signal is converted by a prediction method corresponding 
to this obtained class, so that the digital signal can be more 
suitably converted according to its features. 

Brief Description of the Drawings 

Fig. 1 is a functional block diagram showing the structure 
of an audio signal processing device according to the present 
invention . 

Fig. 2 is a block diagram showing the structure of the audio 
signal processing device according to the present invention. 

Fig. 3 is a flow chart showing an audio data conversion 
processing procedure. 

Fig. 4 is a block diagram showing the structure of a self 
correlation operation unit. 

Fig. 5 is a brief linear diagram illustrating a self 
correlation coefficient judgement method. 

Fig. 6 is a brief linear diagram showing examples of tap 
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cutout . 

Fig. 7 is a brief linear diagram explaining the self 
correlation coefficient judgement method according to another 
embodiment . 

Fig. 8 is a block diagram showing the structure of a 
learning circuit according to the present invention. 

Best Mode for Carrying Out the Invention 

With reference to the accompanying figures one embodiment of 
the present invention will be described. 

Referring to Fig. 1, when the sampling rate of a digital 
audio signal (hereinafter referred to as audio data) is increased 
or the audio data is interpolated, an audio signal processing 
device 10 produces audio data having almost real value by class- 
classification application processing. 

In this connection, audio data in this embodiment may be 
musical data of human being's voice and sounds of musical 
instruments and further, may be data of various other sounds. 

More specifically, in the audio signal processing device 10, 
a self correlation operation unit 11, after cutting out parts of 
input audio data D10 which is input from an input terminal T IN , by 
predetermined time as current data, calculates a self correlation 
coefficient based on each piece of the cut-out current data by a 
self correlation coefficient judgement method, that will be 
described later, and judges a cutting-out range in the time-axis 
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and a phase change based on the calculated self correlation 
coefficient . 

Then, the self correlation operation unit 11 supplies the 
result of judgement on the cutting-out range, in the time-axis, 
which is obtained based on each piece of current data cut out at 
this time, to a variable class-classification sampling unit 12 and 
the variable prediction calculation sampling unit 13 as sampling 
control data Dll, and it supplies the result of the judgement on 
the phase change to a class-classification unit 14 as a 
correlation class D15 expressed by one bit. 

The variable class-classification sampling unit 12 samples 
some pieces of audio waveform data D12 to be classified 
(hereinafter, referred to as class taps) (six samples in this 
embodiment, for example) by cutting the specified ranges out of 
the input audio data D10, which is supplied from the input 
terminal T IN , based on the sampling control data Dll, which is 
supplied from the self correlation operation unit 11, and supplies 
them to the class-classification unit 14. 

The class-classification unit 14 comprises an ADRC (Adaptive 
Dynamic Range Coding) circuit which compresses the class taps D12, 
which are sampled at the variable class-classification sampling 
unit 12, to form a compressed data pattern, and a class code 
generation circuit which obtains a class code to which the class 
taps D12 belongs. 

The ADRC circuit forms pattern compressed data by, for 
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example, compressing each class tap D12 from 8 bits to 2 bits. 
This ADRC circuit conducts the adaptable quantization, and since 
it can effectively expresses the local pattern of the signal level 
with short word length, this ADRC circuit is used for generating a 
code for the class-classification of a signal pattern. 

More specifically, in the case of class-classifying 6 pieces 
of 8-bit data (class taps) , they should be classified into 
enormous number of classes such as 2 48 , thereby increasing the load 
on the circuit. Therefore, in the class-classification unit 14 of 
this embodiment, the class-classification is conducted based on 
the pattern compressed data, which is created at the ADRC circuit 
provided therein. For example, when the one-bit quantization is 
performed on six class taps, the six class taps can be expressed 
by six bits and can be classified to 2 6 = 64 classes. 

At this point, when the dynamic range of class tap is taken 
to be DR; the bit allocation is taken to be m, the data level of 
each class tap to be L; and the quantization code is taken to be Q, 
the ADRC circuit conducts the quantization by evenly dividing data 
between the maximum value MAX and the minimum value MIN into areas 
by the specified bit length, according to the following EQUATION 
(1) . 



DR = MAX 
Q = { (L - 



- MIN + 1 

MIN + 0.5) X 2™ / DR} (1) 
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In the EQUATION (1), { } means that decimal places are 
discarded. Thus, if each of six class taps sampled according to 
the judgement result of the self correlation coefficients 
calculated in the self correlation operation unit 11 is formed of 
eight bits (m = 8), the class tap is compressed to two bits in the 
ADRC circuit. 

Then, where the class taps compressed as described above are 
q n (n = 1 ~ 6) ; the class code generation circuit provided in the 
class-classification unit 14 conducts the arithmetic operation as 
shown in the following EQUATION based on the compressed class taps 
q n , thereby obtaining a class code (class) indicating the class to 
which the class taps (q x ~ q 6 ) belongs. 

class-^q i (2 p ) 1 (2) 

1=1 

At this point, the class code generation circuit integrates 
the correlation class D15 expressed by one bit, which is supplied 
from the self correlation operation unit 11, with the 
corresponding calculated class code (class) . Then the class code 
generation circuit supplies class code data D13 indicating the 
resultant class code (class') to a prediction coefficient memory 
15. This class code (class') indicates a readout address which is 
used in reading out a prediction coefficient from the prediction 
coefficient memory 15. In the EQUATION (2), n represents the 



7 



:i o Cji as : b «-n- .3 o « o .3 ii? 1 o H 1 



number of compressed class taps q n and n = 6 in this embodiment; 
and P represents the bit allocation compressed in the ADRC circuit 
and P = 2 in this embodiment. 

As described above, the class-classification unit 14 
integrates the correlation class D15 with the corresponding class 
code of the class taps D12, which are sampled from the input audio 
data D10 in the variable class-classification sampling unit 12, to 
generate the resultant class code data D13, and supplies this to 
the prediction coefficient memory 15. 

In the prediction coefficient memory 15, sets of prediction 
coefficients corresponding to respective class codes are memorized 
in addresses corresponding to the respective class codes. Then, a 
set of prediction coefficients V} 1 ~ W n memorized in the address 
corresponding to a class code is read out based on the supplied 
class code data D13 from the class-classification unit 14 and is 
supplied to a prediction operation unit 16. 

Furthermore, supplied to the prediction operation unit 16 is 
audio waveform data (hereinafter referred to as prediction taps) 
D14 (X 1 ~ XJ to be prediction-operated, that are cut out and 
sampled based on the sampling control data Dll from the self 
correlation operation unit 11, in the variable prediction 
operation sampling unit 13, in the same manner as the variable 
class-classification sampling unit 12. 

The prediction operation unit 16 conducts a product sum 
operation as shown in the following EQUATION by using the 
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prediction taps D14 (X 1 ~ X n ) , which are supplied from the variable 
prediction operation sampling unit 13, and the prediction 
coefficients W x ~ W n , which are supplied from the prediction 
coefficient memory 15: 

y' = Wj X 1 + W 2 X 2 + + W n X n (3) 

As a result, the prediction result y' is obtained. This prediction 
value y' is sent out from the prediction operation unit 16 as 
audio data D16 with sound quality improved. 

In this connection, the structure of the audio signal 
processing device 10 is shown by the functional blocks described 
above in Fig. 1. And the detailed structure of the functional 
blocks is explained by referring to a device having a computer 
structure as shown in Fig. 2 in this embodiment. More specifically, 
the audio signal processing device 10 comprises a CPU 21, a ROM 
(read only memory) 22, a RAM (random access memory) 15 which is 
the prediction coefficient memory 15 and these circuits are 
connected to each other with a bus BUS. The CPU 21, by executing 
various programs stored in the ROM 22, functions as the functional 
blocks (the self correlation operation unit 11, the variable 
class-classification sampling unit 12, the variable prediction 
operation sampling unit 13, the class-classification unit 14 and 
the prediction operation unit 16) described above in Fig. 1. 

In addition, the audio signal processing device 10 comprises 
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a communication interface 24 for performing communication via a 
network, a removable drive 28 to read out information from an 
external memory medium such as a floppy disk and an optical 
magnetic disk. Also this audio signal processing device 10 can 
read various programs for conducting the class-classification 
adaptive processing as described in Fig. 1, via a network or from 
an external memory medium, in the hard disk of the hard disk 
device 25, in order to perform the class-classification adaptive 
processing according to the read-in programs. 

The user enters a predetermined command via the input means 
2 6 such as the keyboard and the mouse to make the CPU 21 execute 
the class-classification processing described above in Fig. 1. In 
this case, the audio signal processing device 10 enters the audio 
data (input audio data) D10 of which the sound quality should be 
improved, therein via the data input/output unit 27, and after 
applying the class-classification adaptive processing to the input 
audio data D10, it can output the audio data D16 with the sound 
quality improved, to the outside via the data input/output unit 27. 

In this connection, Fig. 3 shows the processing procedure of 
the class-classification adaptive processing in the audio signal 
processing device 10. The audio signal processing device 10 starts 
the processing procedure at step SP101 and at following step SP102, 
calculates a self correlation coefficient of the input audio data 
D10 and based on the calculated self correlation coefficient it 
judges the cutting-out range in the time-axis and the phase change, 
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with the self correlation operation unit 11. 

The judgement result on the cutting-out range in the time- 
axis (i.e., sampling control data Dll) is expressed based on 
whether the feature part and its neighborhood of the input audio 
data D10 has similarity in the roughness of amplitude, and it 
defines a range to cut out the class taps and also defines a range 
to cut out the prediction taps. 

Then, the audio signal processing device 10 moves to step 
SP103, and at the variable class-classification sampling unit 12, 
by cutting the specified range out of the input audio data D10 
according to the judgement result (i.e., sampling control data 
Dll), samples the class taps D12 . Then, the audio signal 
processing device' 10, moving to step SP104, conducts the class- 
classification to the class taps D12 sampled by the variable 
class-classification sampling unit 12. 

Furthermore, the audio signal processing device 10 
integrates the correlation class code obtained as a result of 
judgement on the phase change of the input audio data D10, with 
the class code obtained as a result of class-classification in the 
self correlation operation unit 11. And by utilizing the resulting 
class code, the audio signal processing device 10 reads out a 
prediction coefficients. Prediction coefficients are stored for 
each class by learning in advance. And by reading out the 
prediction coefficients corresponding to the class code, the audio 
signal processing device 10 can use the prediction coefficients 
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matching to the feature of the input audio data D10 at that time. 

The prediction coefficients read out from the prediction 
coefficient memory 15 are used for the prediction operation by the 
prediction operation unit 16 at step SP105. Thus, the input audio 
data D10 is converted to desired audio data D16 by the prediction 
operation suitable for the feature of the input audio data D10. 
Thus, the input audio data D10 is converted to the audio data D16 
of which the sound quality is improved, and the audio signal 
processing device 10, moving to step SP106, terminates the 
processing procedure. 

Next, the self correlation coefficient judgement method of 
the input audio data D10 in the self correlation operation unit 11 
of the audio signal processing device 10 will be explained. 

In Fig. 4, the self correlation operation unit 11 cuts parts 
out of the input audio data D10, which is supplied from the input 
terminal T IN (Fig. 1), at predetermined intervals as current data 
and supplies the current data cut out at this time to self 
correlation coefficient calculation units 40 and 41. 

The self correlation coefficient calculation unit 40 
multiplies the current data cut out, by the Hamming window 
according to the following EQUATION: 

W[k] = 0.54 + 0 . 46*cos ( n *k/N) (k - 0, N-l) ...(4) 

Then, as shown in Fig. 5, the self correlation coefficient 
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calculation unit 40 cuts out search range data AR1 (hereinafter 
referred to as a correlation window (small) ) having the right and 
left sides symmetrical with regard to the target time point 
(current) . 

In this connection, in EQUATION (4), "N" shows the number of 
samples of the correlation windows, and "u" shows the u-th sample 
data. 

Furthermore, the self correlation coefficient calculation 
unit 40 is to select a self correlation operation spectrum set in 
advance, based on the correlation window (small) cut out, so that 
based on the correlation window (small) AR1 cut out at this time, 
it selects, for example, a self correlation operation spectrum SCI. 

R(t) = -3- Sg(i)g(i + 1) , ( 5 ) 



Then, according to the above EQUATION, the self correlation 
coefficient calculation unit 40 multiples the signal waveform g(i) 
formed of N pieces of sampling values by the signal waveform 
g(i+t) delayed by the delay time t, accumulates them and then 
averages the resultant, to calculate the self correlation 
coefficient D40 of the self correlation operation spectrum SCI and 
supplies this to the judgement operation unit 42. 

On the other hand, the self correlation coefficient 
calculation unit 41, by multiplying the current data cut out, by 
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the Hamming window using the same calculation as the EQUATION (4), 
like the self correlation coefficient calculation unit 40, to cut 
out the search range data AR2 (hereinafter referred to as the 
correlation window (large) ) having the right and left sides 
symmetrical with regard to the target time point (current) (Fig. 
5) . 

In this connection, the number of samples "N" used by the 
self correlation coefficient calculation unit 40 in EQUATION (4) 
is set smaller than the number of samples "N" used by the self 
correlation coefficient calculation unit 41 in EQUATION (4). 

Furthermore, out of the self correlation operation spectra 
set in advance, the self correlation coefficient calculation unit 
41 is to select a self correlation operation spectrum in 
correspondence with the self correlation operation spectrum of the 
correlation window (small) cut out and therefor, it selects a self 
correlation operation spectrum SC3 corresponding to the self 
correlation operation spectrum SCI of the correlation window 
(small) AR1 cut out at this moment. Then, the self correlation 
coefficient calculation unit 41 calculates the self correlation 
coefficient D42 of the self correlation operation spectrum SC3 
using the same operation as the above EQUATION (5), and supplies 
this to the judgement operation unit 42. 

The judgement operation unit 42 is to judge the cutting-out 
ranges in the time-axis of the input audio data D10 based on the 
self correlation coefficients supplied from the self correlation 
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coefficient calculation units 40 and 41. And if there exists a big 
difference between the value of the self correlation coefficient 
D40 and the value of the self correlation coefficient D41 supplied 
from the self correlation coefficient calculation units 40 and 41 
respectively, this shows that the condition of audio waveform 
expressed in digital, which is contained in the correlation window 
AR1 and the condition of audio waveform expressed in digital, 
which is contained in the correlation window AR2 are extremely 
different. That is, this shows that audio waveforms of the 
correlation windows AR1 and AR2 are in an abnormal condition with 
no similarity. 

Accordingly, the judgment operation unit 42 judges that it 
is necessary that the size of the class tap and the size of 
prediction tap (cutting-out ranges in the time-axis) should be 
shortened in order to significantly improve the prediction 
operation by finding out the feature of input audio data D10 
inputted at this time. 

Accordingly, the judgement operation unit 42 forms sampling 
control data Dll to cut out the same class tap and prediction tap 
(cutting-out ranges in the time-axis) in size as the correlation 
window (small) AR1, and supplies this to the variable class- 
classification sampling unit 12 (Fig. 1) and the variable 
prediction operation sampling unit 13 (Fig. 1) . 

In this case, in the variable class-classification sampling 
unit 12 (Fig. 1), a short class tap is cut out by the sampling 
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control data Dll as shown in Fig. 6(A), and in the variable 
prediction operation sampling unit 13 (Fig. 1), a short prediction 
tap is cut out in the same size as the class tap by the sampling 
control data Dll as shown in Fig. 6 (C) . 

On the other hand, in the case where there is no big 
difference between the value of the self correlation coefficient 
D40 and the value of the self correlation coefficient D41 supplied 
from the self correlation coefficient calculation units 40 and 41 
respectively, this shows that the condition of audio waveform 
expressed in digital, which is contained in the correlation window 
AR1 and the condition of audio waveform expressed in digital, 
which is contained in the correlation window AR2 are not different 
extremely, i.e., this shows that the audio waveforms are in the 
normal conditions with similarity. 

In this case, the judgment operation unit 42 judges that it 
is capable of finding out the feature of the input audio data D10 
and is capable of conducting the prediction calculation even when 
the sizes of the class tap and the prediction tap (cutting-out 
ranges in the time-axis) are made longer. 

Thus, the judgement operation unit 42 generates sampling 
control data Dll to cut out the same class tap and prediction tap 
(cutting-out ranges in the time-axis) in size as the correlation 
window (large) AR2, and supplies this to the variable class- 
classification sampling unit 12 (Fig. 1) and the variable 
prediction operation sampling unit 13 (Fig. 1) . 
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In this case, in the variable class-classification sampling 
unit 12 (Fig. 1), a long class tap is cut out based on the 
sampling control data Dll as shown in Fig. 6 (B) . And the variable 
prediction operation sampling unit 13 (Fig. 1) cuts out the same 
prediction tap in size as the class tap, based on the sampling 
control data Dll as shown in Fig. 6 (D) . 

Furthermore, the judgement operation unit 42 is to conduct 
the judgement of phase change of the input audio data D10 based on 
self correlation coefficients supplied from the self correlation 
coefficient calculation units 40 and 41. And at this moment, if 
the big difference exists between the value of the self 
correlation coefficient D40 and the value of the self correlation 
coefficient D41 supplied from the self correlation coefficient 
calculation units 40 and 41 respectively, this means that audio 
waveforms are in the abnormal condition with no similarity, then 
the judgement operation unit 42 raises the correlation class D15 
expressed by one bit (i.e., makes it to "1") and supplies this to 
the class-classification unit 14. 

On the other hand, if there is no big different between the 
value of self correlation coefficient D40 and the value of self 
correlation coefficient D41 supplied from the self correlation 
coefficient calculation units 40 and 41, this means that audio 
waveforms are in the normal condition with similarity. Hence, the 
judgement operation unit 42 does not raise the correlation class 
D15 expressed by one bit (i.e., "0") and supplies this to the 
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class-classification unit 14. 

Accordingly, when audio waveforms of the correlation windows 
AR1 and AR2 are in the abnormal conditions with no similarity, the 
self correlation operation unit 11 generates the sampling control 
data Dll to cut out short taps in order to improve the prediction 
operation by finding out the features of the input audio data D10. 
And when audio waveforms of the correlation windows ARl and AR2 
are in the normal state with similarity, the self correlation 
operation unit 11 can generate the sampling control data Dll to 
cut out long taps.' 

Furthermore, if audio waveforms of correlation windows ARl 
and AR2 are in the abnormal state with no similarity, the self 
correlation operation unit 11 raises the correlation class D15 
expressed by one bit (i.e., makes it to "1") and on the other hand, 
when the waveforms of the correlation windows ARl and AR2 are in 
the normal state with similarity, the self correlation operation 
unit 11 does not raise the correlation class D15 expressed by 1 
bit (i.e., "0"), then it supplies the correlation class D15 to the 
class-classification unit 14. 

In this case, the audio signal processing device 10 
integrates the correlation class D15 supplied from the self 
correlation operation unit 11 with the class code (class) obtained 
as a result of class-classification of the class taps D12 supplied 
from the variable classification sampling unit 12 at that time, it 
can conduct the prediction operation by more freguent class- 
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classification. And thus, the audio signal processing device 10 
can generate the audio data of which the audio quality is 
significantly improved. 

In this connection, the present embodiment has described the 
case where each of the self correlation coefficient calculation 
units 40 and 41 selects one self correlation operation spectrum. 
The present invention, however, is not only limited to this but 
also a plurality of self correlation operation spectra may be 
selected . 

In this case, when the self correlation coefficient 
calculation unit 40 (Fig. 4) selects preset self correlation 
operation spectra based on the correlation window (small) AR3 cut 
out at that time, it selects self correlation operation spectra 
SC3 and SC4 as shown in Fig. 7, and calculates self correlation 
coefficients of the selected self correlation operation spectra 
SC3 and SC4 by the same arithmetic operation as that of EQUATION 
(5) described above. Furthermore, the self correlation coefficient 
calculation unit 40 (Fig. 4), by averaging the self function 
coefficients of the self correlation operation spectra SD3 and SC4 
calculated respectively, supplies the newly calculated self 
function coefficient to the judgement operation unit 42 (Fig. 4). 

On the other hand, the self correlation coefficient 
calculation unit 41 (Fig. 4) selects self correlation operation 
spectra SC5 and SC6 corresponding to the self correlation 
operation spectra SC3 and SC4 of the correlation window (small) 
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AR3 cut out at that time, and calculates self correlation 
coefficients of the selected self correlation operation spectra 
SC5, SC6 by the same arithmetic operation as that of the EQUATION 
(5) described above. Moreover, the self correlation coefficient 
calculation unit 41 (Fig. 4), by averaging the self function 
coefficients of the self correlation operation spectra SC5 and SC6, 
supplies the newly calculated self function coefficient to the 
judgement operation unit 42 (Fig. 4). 

When each self correlation coefficient calculation unit 
selects multiple self correlation operation spectra as described 
above, it secures wider self correlation operation spectra. Thus, 
the self correlation coefficient calculation unit can calculate a 
self correlation coefficient using more samples. 

Next, a learning circuit for obtaining a set of prediction 
coefficients for each class to be memorized in the prediction 
coefficient memory 15, which is described in Fig. 1, by learning 
in advance will be explained. 

In Fig. 8, the learning circuit 30 receives teacher audio 
data D30 with high sound quality at a student signal generating 
filter 37. The student signal generating filter 37 thins out the 
teacher audio data D30 at the thinning rate set by a thinning rate 
setting signal D39, at predetermined intervals for the 
predetermined samples. 

In this case, prediction coefficients to be obtained are 
different depending upon the thinning rate in the student signal 
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generating filter 37, and audio data to be reformed by the audio 
signal processing device 10 differ accordingly. For example, in 
the case of improving the sound quality of audio data by 
increasing the sampling frequency in the audio signal processing 
device 10, the student signal generating filter 37 conducts the 
thinning processing to decrease the sampling frequency. On the 
other hand, when the audio signal processing device 10 improves 
the sound quality by supplementing data samples dropped out of the 
input audio data D10, the student signal generating filter 37 
conducts the thinning processing to drop out data samples. 

Thus, the student signal generating filter 37 generates the 
student audio data D37 through the predetermined thinning 
processing from the teacher audio data D30, and supplies this to 
the self correlation operation unit 31, the variable class- 
classification sampling unit 32 and the variable prediction 
operation sampling unit 33. 

The self correlation operation unit 31, after dividing the 
student audio data D37, which is supplied from the student signal 
generating filter 37, into ranges at predetermined intervals (for 
example, by six samples in this embodiment) , calculates the self 
correlation coefficient of the waveform of each time-range 
obtained by the self correlation coefficient judgement method 
described above in Fig. 4. And based on the self correlation 
coefficient calculated, the self correlation operation unit 31 
judges the cutting-out range in the time-axis and the phase change. 
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Based on the self correlation coefficient of the student 
audio data D37 calculated at this time, the self correlation 
operation unit 31 supplies the judgement result on the cutting-out 
range in the time-axis to the variable class-classification 
sampling unit 32 and the variable prediction operation' sampling 
unit 33 as sampling control data D31, and simultaneously, it 
supplies the judgement result of the phase change to the class- 
classification unit 14 as correlation data D35. 

Furthermore, the variable class-classification sampling unit 
32, by cutting the specified range out of the student audio data 
D37 supplied from the student signal generating filter 37, based 
on the sampling control data D31 supplied from the self 
correlation operation unit 31, samples class taps D32 to be class- 
classified (in this embodiment, six samples for example) and 
supplies this to the class-classification unit 34. 

The class-classification unit 34 comprises an ADRC (Adaptive 
Dynamic Range Coding) circuit to form a compressed data pattern 
upon compressing the class taps D32 sampled in the variable class- 
classification sampling unit 32 and a class code generation 
circuit to generate a class code to which the class taps D32 
belongs . 

The ADRC circuit, by conducting the operation to compress 
each class tap D32 from 8 bits to 2 bits, forms pattern compressed 
data. This ADRC circuit is a circuit to conduct the adaptable 
guantization . Since this circuit can effectively express a local 



22 



.1, 01 -O e H • 3 O ,, 10 '3 icr if 3 C3i id' 



pattern of the signal level with a short word length, it is used 
for generating a code for the class-classification of the signal 
pattern. 

More specifically, in the case of class-classifying 6 pieces 
of 8-bit data (class tap) , it is necessary to classify them into 
enormous numbers of classes such as 2 48 , thereby increasing the 
load on the circuit. This class-classification unit 34 of this 
embodiment performs the class-classification based on the pattern 
compressed data which is formed in the ADRC circuit provided 
therein. For example, if the 1-bit quantization is executed to 6 
class taps, the 6 class taps can be expressed by 6 bits and 
classified into 2 6 = 64 classes. 

At this point, if the dynamic range of the class tap is 
taken to be DR, the bit allocation is m, the data level of each 
class tap is L, and the quantization code is Q, the ADRC circuit 
conducts the quantization by evenly dividing the range between the 
maximum value MAX and the minimum value MIN by the specified bit 
length, according to the same arithmetic operation as that of 
EQUATION (1) described above. Accordingly, if each of 6 class taps 
sampled according to the judgement result of self correlation 
coefficients (sampling control data D31) calculated in the self 
correlation operation unit 31 is formed of 8 bits (m = 8) for 
example, the class tap is compressed to 2 bits respectively in the 
ADRC circuit. 

If thus compressed class taps are taken to be q n (n = 1 ~6) 
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respectively, the class code generation circuit provided in the 
class-classification unit 34 executes the same arithmetic 
operation as that of the EQUATION (2) described above based on the 
compressed class tap q n , and calculates a class code (class) 
showing a class to which that class taps (q x ~ q 6 ) belong. 

At this point, the class code generation circuit integrates 
the correlation data D35 supplied from the self correlation 
operation unit 31 with the corresponding class code (class) 
calculated, and supplies the class code data D34 showing the 
resulting class code (class') to the prediction coefficient memory 
15. This class code (class') shows the readout address which is 
used when prediction coefficients are read out from the prediction 
coefficient memory 15. In this connection, in the EQUATION (2), n 
represents the number of compressed class taps q n and n = 6 in this 
embodiment. Moreover, P is a bit allocation compressed in the ADRC 
circuit and P = 2 in this embodiment. 

With this arrangement, the class-classification unit 34 
integrates the correlation data D35 with the corresponding class 
code of the class taps D32 sampled from the student audio data D37 
in the variable class-classification sampling unit 32, and forms 
the resultant class code data D34 and supplies this to the 
prediction coefficient memory 15. 

Furthermore, the prediction taps D33 {X 1 ~ X n ) cut out and 
sampled and to be used for the prediction operation, similar to 
the variable class-classification sampling unit 32, based on the 
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sampling control data D31 from the self correlation operation unit 
31, in the variable prediction computing sampling unit 33 are 
supplied to the prediction coefficient calculation unit 36. 

The prediction coefficient calculation unit 36 forms a 
normal equation by using the class code data D34 (class code 
class') supplied from the class-classification unit 34, prediction 
taps D33 and the teacher audio data D30 with high sound quality 
supplied from the input terminal T IN . 

More specifically, where levels of n samples of the student 

audio data D37 are taken to be x w x 2 , , x n respectively, and 

the quantization data as a result if p bits of ADRC are taken to 

be q x/ , q n . At this point, the class code (class) of this 

range is defined as the Equation (2) described above. Then, where 

levels of the student audio data D37 are taken to be x ir x 2 , , 

x n respectively, and the level of teacher audio data D30 with the 
high sound quality is taken to be y, the linear estimation 
equation of n tap according to the prediction coefficients w lx w 2 , 
/ w n is set for each class code as follows: 

y - V^Xj + W 2 X 2 + . . . + W n X n (6) 

In this connection, the coefficient W n is unknown prior to 
learning . 

The learning circuit 30 learns multiple audio data for each 
class code. When the number of data samples is M, the following 
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Equation is set according to EQUATION (6) . 

y = W,X kl + W 2 X k2 + . . .W n X kn (7) 

Provided that k = 1, 2, M. 

When M > n, prediction coefficients w lf w n are not 

decided uniquely. Therefore, elements of the error vector are 
defined as follows: 

e k = y k - {W x X kl + W 2 X k2 + ... W n X kn } (8) 

Provided that k = 1, 2, , M. Then, the prediction coefficient 

is obtained so that the following EQUATION (9) is the minimum. 
That is, the minimum square method is used. 



e 2 =2> 2 k (9) 

k=0 



At this point, the deviated differential coefficient of w n is 
obtained according to EQUATION (9) . In this case, each W n (n = 1 ~ 
6) may be obtained so that the following EQUATION (10) becomes to 
"0" 



<9e 2 M [ de 1 M M 

^— = Z 2 TT^ e * =Z 2x *. " e * =Z 2x k i- e * (i = 1, 2, ...n) (10) 

dwi k=0 |_dwij k=0 ~ 
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Then, if X tj and Y ± would be defined as following EQUATIONS, 



M 

x u =Z x pi-- x p 3 (id 

p=0 



M 

Y x=2> kl -y k (12) 

k=0 



the EQUATION (10) is expressed as follows, by using the matrix: 
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x 22 
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•- x mn 


w n 




Y n 



(13) 



This equation is generally called as the normal equation. 
In this connection, n = 6. 

After all learning data (the teacher audio data D30, class 
code "class", prediction tap D33) are input, the prediction 
coefficient calculation unit 36 creates the normal equation shown 
in EQUATION (13) described above for each class code "class", and 
by using the general matrix method such as the sweeping out method, 
to obtain each W n , and calculates prediction coefficients for each 
class code. The prediction coefficient calculation unit 36 writes 
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the obtained prediction coefficients (D36) in the prediction 
coefficient memory 15. 

As a result of such learning, prediction coefficients to 
assume the high sound quality audio data y for each pattern to be 

regulated by the quantization data q ir , q 6 are stored for each 

class code in the prediction coefficient memory 15. This 
prediction coefficient memory 15 is used in the audio signal 
processing device 10 described above in Fig. 1. By this processing, 
learning of prediction coefficients for generating the audio data 
with high sound quality from the normal audio data according to 
the linear estimation formula is terminated. 

Accordingly, in the learning circuit 30, the student signal 
generating filter 37 conducts the thinning processing of teacher 
audio data with high sound quality, taking the interpolation 
processing in the audio signal processing device 10 into 
consideration, thereby obtaining the prediction coefficients for 
the interpolation processing in the audio signal processing device 
10 . 

According to the foregoing structure, the audio signal 
processing device 10 calculates the self correlation coefficient 
in the time waveform range of the input audio data D10 with the 
self correlation operation unit 11. The judgement result by the 
self correlation operation unit 11 varies according to the sound 
quality of the input audio data D10. And the audio signal 
processing device 10 specifies the class based on the judgement 
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result of the self correlation coefficients of the input audio 
data D10. 

The audio signal processing device 10 obtains prediction 
coefficients to obtain audio data without deviation and with high 
sound quality (teacher audio data), for each class in advance in 
learning, and conducts the prediction calculation on input audio 
data D10 class-classified based on the judgement result of the 
self correlation coefficients, by the prediction coefficients 
corresponding to that class. Thus, the input audio data D10 is 
prediction-operated using the prediction coefficients 
corresponding that sound quality, so that the sound quality is 
improved to the degree sufficient for practical use. 

Furthermore, at the time of learning for obtaining 
prediction coefficients for each class, by obtaining the 
prediction coefficients corresponding numerous pieces of teacher 
audio data with different phases, even if the phase change occurs 
during the class-classification adaptive processing of the input 
audio data D10 in the audio signal processing device 10, the 
processing corresponding to the phase change can be conducted. 

According to the foregoing structure, since the input audio 
data D10 is class-classified based on the judgement result of self 
correlation coefficients in the time waveform range of the input 
audio data D10 and the input audio data D10 is prediction-operated 
utilizing the prediction coefficients based on the result of the 
class-classification, the input audio data D10 can be converted to 
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the audio data D16 with much higher sound quality. 

The embodiment described above has described the case where 
the self correlation operation units 11 and 31 calculates the self 
correlation coefficients by conducting the arithmetic operation 
according to the EQUATION (5) using the time-axis waveform data 
(the self operation spectrum SCI selected based on the correlation 
window (small) and the self operation spectrum SC2 selected from 
the correlation window (large) corresponding to the self operation 
spectrum SCI). The present invention, however, is not only limited 
to this but also self correlation coefficients may be calculated, 
by calculating conversion data according to EQUATION (5) after 
converting the inclined polarity to the data expressed as the 
feature vector focusing attention onto the inclined polarity of 
time-axis waveform. 

In this case, since the amplitude element of the conversion 
data which is obtained by conversion so as to express the inclined 
polarity of the time-axis waveform as the feature vector is 
eliminated, the self correlation coefficient calculated according 
to the EQUATION (5) is obtained as a value which does not depend 
on the amplitude. Accordingly, a self correlation operation unit 
for computing the conversion data according to EQUATION (5) can 
obtain self correlation coefficient which further depends on the 
frequency element. 

As described above, if the conversion data, which is 
obtained by conversion, is computed according to the EQUATION (5) 
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after converting the inclined polarity to the data expressed as 
the feature vector focusing attention onto the inclined polarity 
of the time-axis waveform, the self correlation coefficient which 
further depends on the frequency element can be obtained. 

Furthermore, the embodiment described above has described 
the case of expressing, by one bit, the correlation class D15 
which is the result of the judgement of phase change conducted by 
the self correlation operation units 11 and 13. However, the 
present invention is not only limited to this but also this can be 
expressed by multi bits. 

In this case, the judgement operation unit 42 of the self 
correlation operation unit 11 (Fig. 4) forms the correlation class 
D15 expressed by multi bits (quantization) according to the 
differential value between the value of self correlation 
coefficient D40 and the value of self correlation coefficient D41 
supplied from the self correlation coefficient calculating units 
40 and 41 and supplies this to the class-classification unit 14. 

Then; the class-classification unit 14 conducts the pattern 
compression onto the correlation class D15 expressed by multi bits 
supplied from the self correlation operation unit 11 in the ADRC 
circuit described- above in Fig. 1, and calculates the class code 
(class 2) indicating the class to which the correlation class D15 
belongs. Moreover, the class-classification unit 14 integrates the 
class code (class 2) calculated with respect to the correlation 
class D15 with the class code (class 1) calculated with respect to 
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the class tap D12 supplied from the variable class-classification 
sampling unit 12, and supplies the resultant class code data 
indicating the class code (class 3) to the prediction coefficient 
memory 15. 

Furthermore, the self correlation operation unit 31 of the 
learning circuit for memorizing a set of prediction coefficients 
corresponding to the class code (class 3) forms the correlation 
class D35 expressed by multi bits (quantization), as in the case 
of the self correlation operation unit 11, and supplies this to 
the class-classification unit 34. 

Then, the class-classification unit 34 pattern-compresses 
the correlation class D35 expressed by multi bits supplied from 
the self correlation operation unit 31, in the ADRC circuit 
described above in Fig. 8, and calculates the class code (class 5) 
indicating the class to which the correlation classes D35 belongs. 
Moreover, at this moment, the class-classification unit 34 
integrates the class code (class 5) calculated on the correlation 
classes D35 with the class code (class 4) calculated on the class 
taps D32 supplied from the variable class-classification sampling 
unit 32, and supplies the class code data indicating the resultant 
class code (class 6) to the prediction coefficient calculation 
unit 36. 

With this arrangement, the correlation class that is the 
result of judgement of phase change conducted by the self 
correlation computing unit 11, 31 can be expressed by multi bits. 
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And thus the frequency of class-classification can be further 
increased. Accordingly, the audio signal processing device which 
conducts the prediction calculation of the input audio data by 
using the prediction coefficients based on a result of class- 
classification can convert audio data to audio data with much 
higher sound quality. 

Furthermore, the embodiment described above has dealt with 
the case of carrying out multiplication by using the Hamming 
window as the window function. The present invention, however, is 
not only limited to this but also by using another window function 
such as the Blackman window in place of the Hamming window, the 
multiplication may be conducted. 

Furthermore, the embodiment described above has dealt with 
the case of using the primary linear method as the prediction 
system. The present invention, however, is not only limited to 
this but also, in short, the result of learning may be used, such 
as the method by multi-dimensional function. In the case where 
digital data supplied from the input terminal T IN is image data, 
various prediction systems, such as the method to predict from the 
pixel value itself can be applied. 

Furthermore, the embodiment described above has dealt with 
the case of conducting the ADRC as the pattern forming means to 
form a compressed data pattern. The present invention, however, is 
not only limited to this but also the compression means such as 
the differential pulse code modulation (DPCM) and the vector 
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quantization (VQ) may be used. In short, if information 
compression means can express the signal waveform pattern with 
small number of classes, it may be acceptable. 

Moreover, the embodiment described above has dealt with the 
case where the audio signal processing device (Fig. 2) executes 
the audio data conversion processing procedure according to the 
programs. The present invention, however, is not only limited to 
this but also such functions may be realized by the hardware 
structure and installed in various digital signal processing 
devices (such as a rate converter, an oversampling processing 
device, a PCM (Pulse Code Modulation) to be used for the BS 
(Broadcasting Satellite)), or by loading these programs from a 
program storage medium (floppy disk, optical disc, etc.) in which 
programs to realize various functions are stored, into various 
digital signal processing devices, these function units may be 
realized. 

According to the present invention as described above, parts 
are cut out of the digital signal by multiple windows having 
different sizes to calculate respective self correlation 
coefficients, and the parts are classified based on the 
calculation results of self correlation coefficients and then, the 
digital signal is converted according to the prediction system 
corresponding to the obtained class, so that the conversion 
suitable for the features of digital signal can be conducted. Thus, 
the conversion to the high quality digital signal having further 
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improved waveform reproducibility can be realized. 

Industrial Utilization 

The present invention can be utilized for a rate converter, 
a PCM decoding device and an audio signal processing device which 
perform data interpolation processing on digital signals. 
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